FIOOOResearch 2014, 3:121 Last updated: 02 JUL 2014 



RESEARCH ARTICLE 




CrossMark 

^- click for updates 



Yeast Augmented Network Analysis (YANA): a new systems 
approach to identify therapeutic targets for human genetic 
diseases [v1; ref status: indexed, http://f1000r.es/3gk] 

David J. Wiley 1 , Nona Juan 1 , Hao Le 2 , Xiaodong Cai 3 , Lisa Baumbach 4 , 
Christine Beattie 2 , Gennaro D'Urso 1 

1 Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL, 33136, USA 
department of Neuroscience, Ohio State University, Columbus, OH, 43210, USA 
department of Engineering, University of Miami, Miami, FL, 33124, USA 
integrated Functional Cancer Genomics, TGEN, Phoenix, AZ, 85004, USA 



v -j First published: 02 Jun 2014, 3:121 (doi: 10.12688/f1000research.4188.1) 
Latest published: 02 Jun 201 4,3:121 (doi: 1 0.1 2688/f 1 000research.41 88.1 ) 

Abstract 

Genetic interaction networks that underlie most human diseases are highly 
complex and poorly defined. Better-defined networks will allow identification of 
a greater number of therapeutic targets. 

Here we introduce our Yeast Augmented Network Analysis (YANA) approach 
and test it with the X-linked spinal muscular atrophy (SMA) disease gene 
UBA1. First, we express UBA1 and a mutant variant in fission yeast and use 
high-throughput methods to identify fission yeast genetic modifiers of UBA1. 
Second, we analyze available protein-protein interaction network databases in 
both fission yeast and human to construct UBA1 genetic networks. Third, from 
these networks we identified potential therapeutic targets for SMA. Finally, we 
validate one of these targets in a vertebrate (zebrafish) SMA model. This study 
demonstrates the power of combining synthetic and chemical genetics with a 
simple model system to identify human disease gene networks that can be 
exploited for treating human diseases. 
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Introduction 

Many disease-associated genes have been identified, yet most genetic 
diseases remain untreatable. One path to treatment is to develop 
extensive genetic networks in which human disease genes function 
(or dysfunction) and then target therapies to the genes identified 
in those networks. Eventually, an individual's own genotype for 
proteins in the network may also be considered in the therapeutic 
options. Developing gene networks is already widely recognized as 
a powerful approach to identify new drug targets 16 . Gene networks 
are based on the principle that networks contain proteins that inter- 
act (physically and/or functionally) and that these interactions gov- 
ern most, if not all, cellular functions. Importantly, gene network 
interactions are often conserved in different organisms even though 
the output of the networks may differ 7 . Candidate networks can be 
generated from studies in model organisms and then extrapolated to 
human cells. One can expect that, for disease gene networks, some 
of the interacting genes might modulate the disease phenotype thus 
representing potential therapeutic targets. 

The direct identification of disease networks from studies in human 
cells is, of course, challenging because of the large number of poten- 
tial proteins and incomplete knowledge of how the activity of any 
one protein may affect the network output. Analysis of networks 
would benefit from studies in a tractable model organism amenable 
to high throughput methods. Given that network interactions may 
be conserved between humans and model organisms, even while 
the outputs may differ, we hypothesized that we can identify human 
disease networks in the fission yeast Schizosaccharomyces pombe, 
a simple, genetically tractable model organism, coupled to existing 
protein-protein interaction databases. We can then transfer that net- 
work knowledge to human cells with the goal of identifying thera- 
peutic target genes. 

How valid is the approach of using fission yeast networks to iden- 
tify human disease gene networks? First, the proteins that control 
most core cellular functions are in fact evolutionarily conserved, 
underscoring the "deep homology" that exists between all living 
organisms 8 9 . For example, in S. pombe >65% of the genome is 
orthologous to human (http://orthomcl.org/orthomcl/ and http://www. 
pombase.org). Second, many of the cellular processes implicated in 
human disease, e.g. vesicular transport, protein folding, metabolism, 
and RNA processing, are also evolutionarily conserved and highly 
interconnected 10 . Importantly, the value of a simple model organism 
for discovering "druggable" genetic pathways has recently been dem- 
onstrated for the budding yeast Saccharomyces cerevisiae 11 , and thus 
extending the study of human disease genes to fission yeast seems 
promising. 

Our approach (Figure 1) involves the expression of human disease 
associated genes in S. pombe and then the analysis of their effect on 
yeast fitness (growth). We performed high- throughput synthetic genetic 
array (SGA) screens to identify the fission yeast genetic modifiers 
that alter this effect, as measured by a simple yeast growth assay. 
The genetic modifiers are then assembled into human disease gene 
networks or clusters (using protein-protein interaction datasets from 
both S. pombe and humans). Any modifiers or genes in the networks 
or clusters represent potential therapeutic targets. This unbiased 
high-throughput approach could be widely and rapidly applied to 
many different disease-associated genes at relatively low cost. 



To demonstrate the power of YANA, we considered spinal muscu- 
lar atrophy, a common neurodegenerative disease affecting approxi- 
mately 1/6000 births worldwide and the number one genetic cause 
of infantile death in the United States 12 . Most cases (>90%) are 
caused by deletion of SMN1, a gene encoding the survival motor 
neuron (SMN), a protein involved in the assembly of spliceosomal 
small nuclear ribonuceoproteins (snRNPs) 13 . In contrast, X-linked 
spinal muscular atrophy is caused by mutations in UBA1, a gene 
encoding the ubiquitin activating (UBA) enzyme 1, an El ubiquitin 
ligase 1415 . Consistent with a role of ubiquitination in spinal muscu- 
lar atrophy and SMN biology, it was recently shown that ubiquitin- 
mediated proteolysis regulates SMN stability 1617 . Therefore, we 
systematically investigated the genetic network for UBA1 to better 
understand the potential connections between UBA1, its modifiers 
and the spinal muscular atrophy phenotype. Using YANA we iden- 
tified several potential therapeutic targets and validated one of these 
targets in a SMA vertebrate (zebrafish) model. 

Methods 

Query Strain Creation 

UBA1 was cloned into a pENTR/D-TOPO vector (Life Technolo- 
gies, Cat # K2400-20) from cDNA (Origene, Cat. # SC320329) fol- 
lowing the manufacturer's protocols. The following primers were 
used: forward primer: 5'- CACCATGTCCAGCTCGCCGC-3'; 
reverse primer: 5'- TC AGCGG ATGGTGTATCGG AC- 3 ' . Genetic 
insertion was confirmed by sequencing (http://sylvester.org/shared- 
resources/oncogenomics). UBA1 (G1617T), mutUBAl, was creat- 
ed by site directed mutagenesis using a QuikChange Lightning Site- 
Directed Mutagenesis kit (Agilent Technologies, Cat. # 210518; the 
detailed protocol is available in the kit). The primers used were: 
forward primer: 5'-GCAGCTGTGCGCCAAATTAATCCACAT- 
ATCCGG-3'; reverse primer: 5 '-CCGGATATGTGGATTAATTTG- 
GCGCAC AGCTGC-3 ' . LR Gateway reactions (Gateway Cloning 
Protocols: http://www.lifetechnologies.com/us/en/home/life-science/ 
cloning/gateway-cloning/protocols. html#lr) were then performed 
to flip the UBA1 genes into destination vectors, to create N-termi- 
nal HA-tagged UBA1 under the control of the nmtl promoter (LR 
Clonase II from Life Technologies, Cat # 11791020). The newly 
generated expression vectors were then integrated into an k leul-32 
ura4-D18Ade6-M210 S. pombe strain (PN572) to create a wtUBAl 
query strain (k integrated pjkl48-nmtl 3X -HA-UBAl-nmtl term leul- 
32 ura4-D18 Ade6-M210) and a mutUBAl query strain (k integrated 
pjkl48-nmtl 3X -HA-UBAl(G1617T)-nmtl tem leul-32 ura4-D18 Ade6- 
M210). All media, growth conditions, and genetic manipulations 
were as previously described 18 . 

Western Blot and Growth Curves 

Strains containing wtUBAl and mutUBAl were grown exponen- 
tially in PMG media (Sunrise Scientific Cat. #2060, keeping the 
OD 595 below 0.4) or eight generations at 32°C and then induced for 
expression of the HA-tagged wtUBAl and mutUBAl by washing 
the cells three times with sterile water to remove thiamine. Cells 
were then grown exponentially for 16 hours and then lysed using 
a FastPrep 120 bead beater (MP Biomedical), followed by boiling 
in sample buffer (2x Laemmli Sample Buffer, Bio-Rad Cat. #161- 
0737) and then clarified by centrifugation. The expression of UBA1 
was confirmed by Western Blot analysis using an anti-HA antibody 
at 1:2000 dilution (Covance, Cat #MMS-101P, AB_10063488) and 
standard procedures 19 . Growth curve analysis was completed in the 
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Figure 1 . YANA analysis of UBA1 genes. Numbers illustrate various steps in our analysis. (1 ) Construction of query strains by integrating 
wildtype and mutant variants of the UBA1 gene into pombe. (2) Crossing the query strains (mutant or wildtype) to a complete non-essential 
deletion collection. (3) Bioinformatic analysis to generate the top 'gene hits'. (4) Validation of the top hits in yeast. (5) Confirmation of the top 
hits in zebrafish. 



presence and absence of thiamine (final concentration, 15 uM; 
Sigma, Cat. #T4625). Removal of thiamine was achieved by wash- 
ing the cells three times with sterile water. Cells were incubated 
in the absence of thiamine for 22 hours under exponential growth, 
diluted and 120 ul added to the Tecan Infinity F200 plate readers 
(starting OD 595 of 0.05) for growth curve analysis with an n=5 for 
each sample. For validation experiments, cul3 and gsk3 were not 
pre-induced, while publ was pre-induced for 22 hours. The dele- 
tions of cul3, publ, and gsk3 were confirmed by PCR analysis of 
genomic DNA from each strain, as previously described 20 . 

SGA screening 

Query strains were grown in liquid media and then pinned to agar in 
a 384-format using a RoToR HDA (Singer Instruments). The query 
was then crossed to the S. pombe haploid deletion library (Bioneer, 
Version 3.0 equivalent) on SPAS media (details can be found at 
http://www-bcf.usc.edu/~forsburg/media.html) using a modified 
SGA procedure 21 . For germination, four replicates of each cross 
were pinned to a 1536 format on selective PMG media containing 
thiamine, adenine (225 mg/L, Sigma Cat. #A8751), leucine (225 mg/L, 
Sigma Cat. #L8912), and the antibiotic G418 (150 uM, Gold Bio 



Cat. #G-1418) (PAUT+G418). After three days, the plates were 
then pinned to both PAUT+G418 (non-inducing) and PAU+G418 
(inducing) plates. The colonies growing on the PAUT+G418 plates 
were documented on a flatbed scanner for the next three consecu- 
tive days. The PAU+G418 plates were grown for two days, and then 
re-pinned to fresh PAU+G418 plates and documented over the next 
three consecutive days. Based on the growth characteristics of the 
wiUBAl strain, the plates were then pinned to fresh PAU+G418 
plates and documented for an additional three days. 

Hit analysis 

The documented plates were analyzed for 'Hits' representing a 
growth defect (SL) or growth suppressor (SS) using ScreenMill 
software 22 . The software is used to quantify colony size for each 
individual cross and then to normalize the quantified plates with 
and without replicate exclusion for each quadruplicate of the query 
crossed to a specific deletion strain. Data was then compared 
(between non-induced versus induced) and ranked in Excel (P<0.05) 
(Microsoft). The orthologs were then identified based on curated 
data from PomBase (www.pombase.org; build 2013-1 1-1 1-vl), 
OrthoMCL (http://orthomcl.org/orthomcl/; Version 5), InParanoid8 
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(http://inparanoid.sbc.su.se/cgi-bin/index.cgi; Version 8.0), and 
Homologene (http://www.ncbi.nlm.nih.gov/homologene; build 67). 

Bio-Informatics 

The S. pombe and human 'Hits' were analyzed in String (www.string- 
db.org) to map protein-protein interactions limited to data from 
experiments at the highest confidence level (0.900), and named 
pombe primary and human primary. The S. pombe primary was then 
extended by adding first degree neighbors to the original pombe 
'Hits', keeping all interactions based on data from experiments at 
the highest confidence level (0.900). From the extended pombe net- 
work interaction map, new 'Hits' were extracted that corresponded 
to at least two previous 'orphans' ('Hits' that were not previously 
mapped) that interact through a nearest neighbor. All 'Hits' were 
then analyzed through PubMed (http://www.ncbi.nlm.nih.gov/pub- 
med/) for relevance in terms of SMA, with special focus on the 
'Hits' in the interaction maps. 

Morpholino (MO) injection and drug test 

Transgenic Tg(mnxl :0.6hsp70:GFP)os26 embryos that express GFP 
in ventrally projecting motor axons, referred to as Tg(mnxl:GFP) 
embryos, were used for all zebrafish experiments. Embryos were 
staged according to Fritz et alP All fish were grown and maintained 
in the Ohio State University (OSU) zebrafish facility following estab- 
lished protocols and OSU animal welfare guidelines as stated in 
Dr. Beattie's animal protocol (On file at OSU: 2009A0141-R1). 
Tg(mnxl:GFP) embryos were injected with 4 ng smn MO at the 
one- to 2-cell stage to knock down Smn as previously described 24 . 
At 10 hours post-fertilization (hpf) injected embryos (in their cho- 
rions) were placed in Petri dishes in fish water (60 ug/ml Instant 
Ocean® sea salts) containing compounds in 0.75% dimethyl sul- 
foxide (DMSO) or 0.75% DMSO only and incubated at 28.5°C in 
incubator until 28 hpf. 

Tg(mnxl:GFP) embryos at 28 hpf were anesthetized with tricaine 
(250 ug/ml, Sigma A-5040) and fixed overnight at 4°C in 4% formal- 
dehyde/PBS. After removing embryos from fix, they were mounted 

A B 



WT Control wtUBAl mutUBAl 



+T -T +T -T +T -T 




UBA1 
«<— tubulin 



on glass coverslips for observation under a Zeiss Axioplan micro- 
scope. Motor axons innervating the mid- trunk (myotomes 6-15) on 
both sides of the fish were scored as described 25 . 

The control topped b458 embryos were treated and fixed as described 
above for Tg(mnxl:GFP) embryos; the topped mutants, however, 
were processed for znpl antibody (Hybridoma Bank Cat#znp-1, 
AB_531910) labeling as previously described 26 to visualize motor 
axons for scoring as they did not have the Tg(mnxl:GFP) in the 
background. 

Zebrafish western blot 

Samples of zebrafish embryos injected with 4 ng smn MO at the 
one- to two-cell stage and uninjected embryos were treated from 10 
to 28 hpf with DMSO or with DMSO and compounds as described 
above were collected for western blot. 

Samples were generated by boiling 25 identically treated embryos 
in 75 ul blending buffer (63 mM Tris (pH 6.8), 5 mM EDTA, 10% 
SDS). 10 ul (equivalent to 3 embryos or 75 ug of protein) were 
added to 10 ul of sample buffer (100 mM Tris (pH 6.8), 0.2% 
bromophenol blue, 20% glycerol, 200 mM dithiothreitol) and run 
on a 10% polyacrylamide gel, blotted to nitrocellulose, probed with 
mouse anti-Smn (1/500; MANSMA12, a gift from Dr. G.E. Morris 
or anti HuD (1/1000; Santa Cruz Cat#sc-28299, AB_627765) and 
detected by chemiluminescence of bound HRP-conjugated mouse 
antibody. Blots were stripped and re-probed with mouse anti- (3- 
actin (1/1000; Santa Cruz Cat #sc-47778, AB_626632). 

Results 

We expressed the human wildtype UBA1 (wtUBAl) and the dis- 
ease-causing variant \6\1UBA1 (mutUBAl) in S. pombe under the 
control of a regulatable nmtl promoter 27 . We identified strains with 
stable integrated human UBA1 genes expressing equivalent levels 
of UBA1 proteins (Figure 2A). Under non-inducing conditions 
(+ thiamine) all OTA 7 -containing strains had a similar growth rate 
to the control non-UBAl -containing wildtype strain (Figure 2B). We 




Time (Minutes) 

^WT Control +T— WT Control -T ^wtUBAl +T 
— wtUBAl -T ^mutUBAl +T —mutUBAl -T 



Figure 2. Construction of S. pombe strains integrated with human UBA1. Wildtype and mutant forms of UBA1 were integrated into fission 
yeast under the control of the nmtl thiamine-repressible promoter. (A) Western blot analysis of protein expression levels shows that both forms 
of UBA1 (wildtype and mutant) are expressed to equivalent levels. (B) Expression of wild-type UBA1 inhibits S. pombe cell growth. 
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then found that cells expressing human wtUBAI (fission yeast cells 
grown in media lacking thiamine to induce UBA1 expression), expe- 
rienced a decrease in cell growth. No growth defect was observed 
for yeast cells expressing mxxiUBAl when compared to the control 
(Figure 2B). 

We set out to identify yeast gene mutations that either enhance or 
suppress cell growth dependent upon expression of either wt or 
mxxiUBAl (i.e., epistatic modifiers). We used automated genetics to 
introduce the wt and mxxiUBAl genes into a deletion strain collection 



corresponding to more than 90% of the non-essential genes in 
S. pombe (ca. 80% of the complete genome). We then tested each 
of the >7000 unique strains expressing either wt or mxxiUBAl com- 
bined with a specific gene deletion, for growth properties. Each 
strain was scored for significance of growth difference compared 
to a control, and the top hits (P<0.05) were assembled into a data- 
base (Table 1). The categories of growth phenotypes were those 
that were synthetic lethal for wildtype or mutant (wt-SL, or mut- 
SL) or synthetically suppressed for wildtype or mutant (wt-SS, or 
mut-SS). We identified 173 UBA1 or mutUBAl modifiers. Notably, 



Table 1. S. pombe SGA top hits. The tops hits for wtUBAI SL, wtUBAI SS, mutUBAl SL, 
and mutUBAl SS were identified by bioinformatic analysis as described in the materials and 
methods section. 





wtUBAI SL 


wtUBAI SS 


mutUBAl SL 


mutUBAl SS 


1 


SPAC24H6.03 


SPBC365.06 


SPBC8D2.01 


SPAC11G7.02 


2 


SPAC1687.15 


SPAC11G7.02 


SPBC6B1.10 


SPCC13B11.01 


3 


SPAC3G9.07c 


SPCC31H1 2.05c 


SPBC216.05 


SPBC2G2.03C 


4 


SPBC6B1.10 


SPBC31F10.13C 


SPCC1 795.01c 


SPBC31F10.13C 


5 


SPAPYUG7.04C 


SPCC1620.14C 


SPBC24C6.11 


SPCC1620.14C 


6 


SPBC24C6.11 


SPAC57A10.02 


SPBC1 734.08 


SPAC3G6.02 


7 


SPBC1 734.08 


SPAC1556.01C 


SPAC9.13C 


SPAC1556.01C 


8 


SPAC23D3.09 


SPAC30D11.13 


SPAC23D3.09 


SPAC821.05 


9 


SPAC17A5.11 


SPCC970.07C 


SPAC57A10.02 


SPBC6B1.04 


10 


SPAC16A1 0.05c 


SPAC23H4.12 


SPAC16A1 0.05c 


SPCC1919.15 


11 


SPCP1E1 1.07c 


SPCC11E10.08 


SPCC895.07 


SPBC23E6.09 


12 


SPBC146.13C 


SPBC21D10.12 


SPCP31B10.05 


SPBC21D10.11C 


13 


SPAC821.09 


SPCC1 672.06c 


SPCC550.12 


SPAC22H10.07 


14 


SPBC23E6.09 


SPAC3C7.03C 


SPAC23H4.12 


SPCC338.08 


15 


SPCC1682.16 


SPBC32F12.02 


SPBC1 1C1 1 .02 


SPBP35G2.08C 


16 


SPBC106.01 


SPCC736.04C 


SPBC1289.11 


SPBC29A3.14C 


17 


SPCC622.16C 


SPBC30D10.13C 


SPCC1 26.02c 


SPAC29A4.18 


18 


SPAC22H10.07 


SPCC737.09C 


SPBC13G1.13 


SPBC1921.03C 


19 


SPBC13G1.13 


SPBC56F2.01 


SPCC188.13C 


SPBC2F12.11C 


20 


SPBC16H5.06 


SPBC2F12.11C 


SPCC663.12 


SPAPB1 E7.02c 


21 


SPBC29A10.05 


SPBC16A3.07C 


SPCC338.05C 


SPAC17A5.16 


22 


SPBC28F2.07 


SPAC5D6.05 


SPAC343.11C 


SPAC20H4.03C 


23 


SPCC1 827.08c 


SPAC18G6.02C 


SPCC1919.03C 


SPCC1 259.03 


24 


SPAC9G 1.10c 


SPAC19D5.01 


SPAC343.18 


SPAC18G6.02C 


25 


SPBC83.03C 


SPAC688.11 


SPCC737.09C 


SPAPB1 E7.06c 


26 


SPBC342.04 


SPAC227.07C 


SPBC106.16 


SPAC1851.04C 


27 


SPCC4G3.19 


SPAC1851.04C 


SPAC3A1 1.14c 


SPAC13G7.06 


28 


SPAC9G1.05 


SPBC16H5.07C 


SPAC589.02C 


SPBC725.11C 


29 


SPCC4E9.02 


SPAC17A5.18C 


SPCC1 450.05c 


SPCC1739.12 


30 


SPCC794.11C 


SPBC725.11C 


SPAC8F11.03 


SPCC663.01C 


31 


SPBC21 6.06c 


SPAC23C11.08 


SPAC13A1 1.04c 


SPBC21C3.02C 


32 


SPCC1 450.05c 


SPBC3B8.02 


SPAC1 952.07 


SPAC1 F7.09c 
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wtUBAI SL 


wtUBAI SS 


mutUBAI SL 


mutUBAI SS 


33 


SPAC22A12.16 


SPBC17D1.06 


SPAC14C4.03 


SPBC365.11 


34 


SPAC12B10.07 


SPAC11E3.06 


SPAC19G12.06C 


SPAC1 44.04c 


35 


SPAC4A8.05C 


SPCC663.01C 


SPAC2F7.04 


SPCC4F1 1.03c 


36 


SPAC1 B9.02c 


SPAC1 67.04 


SPAC1 006.09 


SPAC4G8.03C 


37 


SPAC16C9.05 


SPAC14C4.12C 


SPAC17G6.04C 


SPBP8B7.06 


38 


SPAC23H3.08C 


SPBC2A9.06C 


SPAC22H12.02 


SPCC1 223.05c 


39 


SPAC2F7.04 


SPAC24H6.13 


SPBC725.02 


SPAC1 556.05c 


40 


SPAC22H 12.02 


SPAPB1A10.03 


SPAC140.01 


SPBC16C6.03C 


41 


SPBC725.02 


SPAC12G12.03 


SPAC 1805.04 


SPBC2G2.07C 


42 


SPAC21E1 1.03c 


SPCC663.03 


SPBC1709.10C 


SPAC31G5.12C 


43 


SPBC12C2.02C 


SPBC1539.10 


SPBC577.15C 


SPAC4G9.15 


44 


SPBC21B10.13C 


SPCC1 223.05c 


SPAC 1420.03 


SPAC25B8.05 


45 


SPAC1687.13C 


SPCC31H12.04C 


SPBC29A1 0.03c 


SPAC1 071.07c 


46 


SPBC3F6.05 


SPCC1 739.08c 


SPAC4F10.04 




47 


SPBC4B4.03 


SPAC2C4.07C 


SPAPB1E7.12 




48 


SPAC222.16C 


SPAC1 142.08 


SPCC320.08 




49 


SPAC4F10.04 




SPCC830.07C 




50 


SPAC19B12.10 




SPAC4G9.05 




51 


SPBC2G2.02 




SPBC13G1 .14c 




52 


SPAC1 834.04 




SPCC24B1 0.08c 




53 


SPAPB1E7.12 




SPAC4H3.03C 




54 


SPCC320.08 




SPAC 1805.05 




55 


SPCC830.07C 








56 


SPAC4G9.05 









57 SPBC13G1.14C 



145 of the modifiers (83.8%) were orthologous to human genes, 
and thus are potential drug targets to modify the SMA phenotype. 

To generate our candidate human disease gene networks (that we 
shall call network clusters) we relied on two complementary ap- 
proaches. Both approaches rely on networks constructed from pub- 
lished protein-protein interaction data from either human (Figure 3 
and Figure S3) or fission yeast (Figure 4, Figure SI and Figure S2). 
By including two types of genetic interactions, synthetic lethals or 
synthetic suppressors, yeast augmented network analysis (YANA), 
yields the most complete unbiased list of potential modifiers. In a 
first approach, we converted all 145 S. pombe modifier genes to 
their corresponding human orthologs. All the data were combined 
to create network cluster diagrams. Individual datasets were deline- 
ated using a color key. (Note that we designate each interaction as a 
synthetic lethal or synthetic suppressor a priori, and include them 
both as it is difficult to predict which of these classes will have 
potential therapeutic value). Using the highest confidence (0.9) exper- 
imentally confirmed protein-protein interaction data (from www. 
string-db.org), we used cytoscape to draw network cluster modules 
that include all modifier genes. We identified 22 clusters using this 
approach (Figure S3). We then limited the number of networks by 
focusing on those modifiers that shared at least two interactions with 



other modifiers shrinking the number of clusters to 10 (Figure 3). 
Within our human primary network clusters, we identified several 
genes that represent compelling therapeutic targets. These includ- 
ed GSK3 (Glycogen Synthase Kinase-3), CUL3 (Cullin-3) and 
NEDD4 (E3 ubiquitin ligase) along with SUMOl (Small Ubiquitin 
Modifier- 1) and HDAC (Histon deacetylase), a known chromatin 
modifier. Interestingly, GSK3 inhibitors increase SMN levels in 
spinal muscular atrophy patient-derived fibroblasts and mouse 
motor neurons 28 . The HDAC inhibitor Trichostatin A increases 
SMN expression and survival in a mouse model of spinal muscular 
atrophy 29 . Therefore, evidence already exists that genes within our 
clusters affect SMA biology. 

In our second approach, we constructed networks from just the 
S. pombe modifier genes using high confidence S. pombe protein- 
protein interaction data from STRING. For this analysis, we identi- 
fied network clusters that were expanded to include all the so-called 
"first-degree (1°) interacting proteins", proteins not identified in our 
SGA screens but that physically interact with a genetically identi- 
fied modifier (Figure SI). The rationale underlying our approach 
to generate larger clusters comes from the facts that the 1° neigh- 
bor may be an essential gene (and therefore it is not present in our 
deletion library) or may fail to be detected in our screens for other, 



Page 6 of 19 



FIOOOResearch 2014, 3:121 Last updated: 02 JUL2014 





SHFMj i^l ^PP6C 



f |pP6R^ 



BUB1 




STAM 




NFYA 









STAM 

stam; 




Color 


WtUBAl SL 


WtUBAl SS 


mutUBAl SL 


mutUBAl SS 


Red 


X 








Green 










Orange 






X 




Light Green 








X 


Yellow 


X 




X 




Light Blue 










Silver 


X 






X 


Violet 




x 


x 




White 











Figure 3. Human UBA1 primary gene interaction network. Human orthologs of yeast genes identified from four independent SGA screens 
were used to map the corresponding protein-protein interaction network. Only proteins that have experimentally verified data (STRING) 
supporting a physical interaction with at least two additional genes/proteins identified in the UBA1-SGA screens are displayed. Of particular 
interest is the discovery of NEDD4, CUL3, GSK3, SUMO and HDAC in distinct network clusters. Proteins of interest are highlighted by a larger 
node size and indicated with an arrow. 
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Figure 4. S. pombe extended UBA1 gene interaction network (extracted from Figure S1 on the basis of identifying new primary hit 
interactions through a 1° neighbor). Primary yeast gene hits from all four SGA screens were combined and mapped based on highest 
confidence experimentally confirmed protein-protein interaction data. For this extended analysis, we also included first-degree neighbors of 
modifiers that were not directly identified in the SGA screens. Only selected gene clusters are shown here; the complete extended network is 
shown in Figure S1 . When extrapolated to human, the yeast extended network analysis identified Csn5 and Cul3, two modifiers also identified 
in our human UBA1 primary gene interaction network (boxed; see also Figure 3). 
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unknown reasons. This "Cluster with 1° neighbor" analysis reveals 
new interactions not observed in the yeast primary UBA1 gene 
interaction network (Figure S2). A few examples of these types of 
clusters are shown in Figure 4. The clusters include several genes 
all of which were uniquely identified in the wildtype screen, includ- 
ing both synthetic lethals and synthetic suppressors. Of particular 
interest is the cluster of proteins surrounding Skpl(S-phase kinase 
associated protein- 1), a protein that interacts and stablilizes F-box 
proteins, additional F-box proteins, and proteins involved in cul- 
lin deneddylation and neddylation. In addition, we found that the 
RNA-binding protein Cwf2 interacts directly with three of our 
modifiers, and the plastin ortholog Fiml, interacts with two. Inter- 
estingly, Plastin 3 (PLS3) has been identified as a disease modifier 
in animal models of spinal muscular atrophy 30 32 . 

We then identified correlations between the extended S. pombe 
network clusters (Figure 4) and the human primary network clus- 
ters (Figure 3). In both networks, we identified components of the 
COPS complex, a known regulator of E3 cullins, and the E3 cul- 
lin Cul3. Note that Cul3 was not present in our primary S. pombe 
network (Figure S2) yet was present in the more extended network 



clusters that incorporate 1° neighbors; this illustrates the impor- 
tance of adding 1° neighbors to our analysis of the S. pombe data. 

Together, these data suggest a prominent role for E3 ubiquitin ligases 
in modulating UBA1 and potentially SMN1, making Cul3 the 
primary target for further analysis. The E3 ubiquitin ligase might 
inhibit SMN function, as ubiquitination often leads to protein deg- 
radation. If so, we reasoned that inhibition of E3 ubiquitin ligase 
might enhance SMN activity and thus suppress loss of function 
mutations in SMN1. To directly test the hypothesis that inhibition 
of E3 ubiquitin ligases can suppress a SMN1 -spinal muscular 
atrophy phenotype, we turned to a vertebrate model of SMN defi- 
ciency previously reported in zebrafish 33 . Knockdown of Smn in 
zebrafish embryos causes developmental defects in motor neuron 
axonal outgrowth that include truncations and abnormal branch- 
ing of neurons 24 . Motor neuron axon defects can be corrected by 
injection of mRNAs encoding wildtype human SMN 25 . We first 
confirmed that injection of smn morpholino (MO) into wildtype 
zebrafish causes severe motor axon abnormalities as compared to 
control uninjected embryos (Figure 5 A and 5B). We then tested 
whether an inhibitor of E3 ubiquitin ligases (MLN4294) would 
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Figure 5. MLN4294 rescues the abnormal neuron outgrowth in Smn-depleted zebrafish. (A) Representative lateral views of motor axons 
in Tg(mnx1:GFP) zebrafish embryos expressing GFP in motor neurons and injected with control MO and then grown in 10 or 15 uM MLN4294. 
(B) Quantification of the effects of MLN4294 on motor axon development in zebrafish. Motor axons were scored in Tg(mnx1 :GFP) embryos 
injected with control MO, and subsequently (10 hrs post-injection) incubated in 10 or 15 uM MLN4294. Embryos were classified as severe, 
moderate, mild, or no defects based on the severity of motor axon defects, and the percentage of each group is shown. Data in all graphs 
are represented as mean and SEM. (C) Western blot analysis of Smn and HuD protein following treatment with MLN4294. Quantification of 
the results is shown in Figure S5. 
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suppress the motor axon abnormalities. Addition of the Nedd8-El 
activating enzyme inhibitor MLN4294 (that blocks cullin-RING E3 
ligases), at concentrations ranging from 10 uM to 15 uM, caused 
a concentration-dependent reduction in the degree of abnormal 
motor axon branching (Figure 5 A and 5B). At 15 uM inhibitor 
we found motor axon abnormalities that were completely sup- 
pressed: defects were not significantly different to those observed in 



uninjected embryos (P=0.379). At higher concentrations of drug 
(20 uM), the rescue was less pronounced (Table 2) probably because 
higher levels of drug led to defects in development. Thus, E3 ligase 
inhibition can rescue neuronal defects caused by Smn protein deple- 
tion in zebrafish. In control experiments, MLN4294 failed to rescue 
the zebrafish mutant topped (Figure 6), a mutant defective in neuronal 
axon guidance 34 . We also tested a drug that inhibits sumolyation 



Table 2. Complete dataset for zebrafish experiments. 
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Figure 6. Treatment of zebrafish with MLN4294 fails to rescue defects in ventral motor axon guidance in the topped mutant. (A) 

Quantification of the effects of MLN4294 on motor axon guidance in the zebrafish topped mutant. Motor axons were scored in embryos 
injected with DMSO, 10 or 15 uM MLN4294. Embryos were classified as severe, moderate, mild, or no defects based on the severity of motor 
axon defects, and the percentage of each group is shown. Data in all graphs are represented as mean and SEM. (B) Experimental data. 
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corresponding to an unrelated target (SUMO) identified in our human 
primary network clusters (Figure 3); addition of this compound had 
no effect on the spinal muscular atrophy model (Figure S6) neces- 
sarily surprising since pmt3, which encodes the yeast ortholog of 
SUMOl, has the opposite effect of cul3 when deleted. Having been 
identified as a synthetic suppressor rather than a synthetic lethal, 
perhaps compounds that activate rather than inhibit sumolyation 
would be beneficial as a therapeutic. 

To determine how E3 ubiquitin ligases might influence Smnl func- 
tion, we examined the protein levels for both Smn and HuD. HuD is 
a known SMN interacting protein that binds RNAs controlling their 
translation and stability and functions in neural development and 
plasticity 34-36 . We observed greater than a 2-fold increase in both 
Smn and Hud protein levels following treatment with MLN4294, a 
NEDD8 activating enzyme (NAE) inhibitor that prevents activation 
of E3 ligases (Figure 5C and Figure S5). It is therefore possible that 
rescue of the neuronal defects in our zebrafish model might involve 
stabilization of Smn as well as of additional proteins within the 
Smn network. 

r a 

Growth rate data of expression of UBA1 in S. pombe 

1 Data File 

http://dx.doi.Org/1 0.5256/f 1 000research.41 88.d28505 

UBA1 was expressed in S. pombe under inducing (- thiamine) and 
non-inducing conditions (+ thiamine) in wildtype strains (wtUBAI) 
and the disease-causing variant wildtype 1617UBA1 (mutUBAI), 
under the control of a regulatable nmtl promoter. Growth rate data 
of expression of UBA1 under inducing (- thiamine) and non-inducing 
conditions (+ thiamine) in wtUBAI and mutUBAI are shown. For 
each condition there are six replicates. 

V J 

Discussion 

In this paper we introduce YANA, a fission yeast genetic assay that 
that when applied to a specific human disease gene can leverage 
protein-protein interaction data to characterize human disease net- 
works. From these networks, we can identify and prioritize genetic 
pathways likely to modify the disease-associated gene activity, and 
predict those genes that can be exploited as therapeutic targets. Using 
UBA1, a gene associated with X-linked spinal muscular atrophy 
(XL-SMA), we found a high degree of homology between our yeast 
modifiers and their corresponding human genes (>80%), underscor- 
ing the extensibility of the assay. Specifically, we identified several 
network clusters and high priority targets for therapeutic intervention, 



including GSK3 and HDAC both of which are potential therapeu- 
tic targets for SMA. In addition, we found two novel related tar- 
gets Cul3 (a cullin required for E3 ubiquitin ligase activity) and 
NEDD4, an E3 ubiquitin ligase. 

Our yeast system, with its small but complex eukaryotic genome, 
and complete deletion library, is unique in allowing unbiased genome- 
wide screening of deletions that alter human disease gene activity. 
Moreover, YANA can be applied to any human gene, regardless of 
the phenotype or availability of endogenous mutations. The number 
of candidate genes identified by YANA for UBA1 represents a frac- 
tion of the approximately 30,000 genes in the human genome, pro- 
viding a significant enrichment of potential modifiers. Therefore, 
YANA offers a simple, cost-effective, and relatively rapid technol- 
ogy that could be applied to all human genetic diseases. 

Data availability 

FIOOOResearch: Dataset 1. Growth rate data of expression of UBA1 
in S. pombe, 10.5256/fl000research.4188.d28505 37 . 
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Figure S2. S. pombe UBA1 primary gene interaction network. Fission yeast epistatic modifiers from all four SGA datasets were combined 
and used to map a protein-protein interaction network using the highest confidence protein interaction data (STRING-db). 
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Figure S3. Complete human UBA1 primary gene interaction network representing 22 clusters. Human orthologs of yeast genes identified 
from four independent SGA screens were used to map the corresponding protein-protein interaction network. 
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Figure S4. Validation of SGA data for cul3,pub1 and gsk3. Growth curves of the 3 double mutants confirmed our original SGA data showing 
that the deletions of cul3 and gsk3 are synthetically lethal with OE of UBA1, and the deletion of publ {NEDD4) suppresses the toxicity 
associated with OE of wildtype UBA1. For gsk3 and cul3, the cells were washed to remove thiamine and immediately placed in the plate 
reader whilst for publ, cells were first pre-induced for 22 hrs. Deletions of cul3, publ, and gsk3 were confirmed by PCR. 
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Figure S5. Expression of Smn and HuD in zebrafish embryos. Quantification of western blot results shown in Figure 5C. Treatment of 
zebrafish embryos with MLN4294 results in approximately a 3-fold increase in both Smn and HuD levels 18 hrs following treatment. 
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Figure S6. Zebrafish raw data. A) MLN4924 treatment in smn MO zebrafish. B) MLN4924 treatment in topped mutant zebrafish. C) Anacardic 
acid treatment in smn MO zebrafish. 
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The Wiley etal manuscript describes a beautiful synthesis of contemporary genetic approaches to, with 
astonishing efficiency, identify lead compounds for therapeutic approaches to a serious human disease. I 
believe the importance of this paper stems from the applicability of the approach to the several thousand 
of rare human disease genes that Next-Gen sequencing will uncover in the next few years and the 
challenge we will have in figuring out the function of these genes and their resulting defects. This work 
presents a paradigm that can be broadly and usefully applied. 

In detail, the authors begin with gene responsible for X-linked spinal muscular atrophy and express both 
the wild-type version of that human gene as well as a mutant form of that gene in S. pombe. The 
conceptual leap here is that progress in genetics is driven by phenotype, and this approach involving a 
yeast with no spine or muscles to atrophy is nevertheless and N-dimensional detector of phenotype. 

The study is not without a small measure of luck in that expression of the wild-type UBA1 gene caused a 
slow growth phenotype which the mutant did not. Hence there was something in S. pombe that could feel 
the impact of this protein. Given this phenotype, the authors then went to work and using the power of the 
synthetic genetic array approach pioneered by Boone and colleagues made a systematic set of double 
mutants combining the human expressed UBA1 gene with knockout alleles of a plurality of S. pombe 
genes. They found well over a hundred mutations that either enhanced or suppressed the growth defect 
of the cells expressing UBI1 . Most of these have human orthologs. My hunch is that many human genes 
expressed in yeast will have some comparably exploitable phenotype, and time will tell. 

Building on the interaction networks of S. pombe genes already established, augmenting these networks 
by the protein interaction networks from yeast and from human proteome studies involving these genes, 
and from the structure of the emerging networks, the authors deduced that an E3 ligase modulated UBA1 
and made the leap that it therefore might also impact X-linked Spinal Muscular Atrophy. 

Here, the awesome power of the model organism community comes into the picture as there is a 
zebrafish model of spinal muscular atrophy. The principle of phenologs articulated by the Marcotte group 
inspire the recognition of the transitive logic of how phenotypes in one organism relate to phenotypes in 
another. With this zebrafish model, they were able to confirm that an inhibitor of E3 ligases and of the 
Nedd8-E1 activating suppressed the motor axon anomalies, as predicted by the effect of mutations in S. 
pombe on the phenotypes of the UBA1 overexpression. 
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I believe this is an important paper to teach in intro graduate courses as it illustrates beautifully how 
important it is to know about and embrace the many new sources of systematic genetic information and 
apply them broadly. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 
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This paper addresses the vexing problem of how to unravel the complexity of human diseases. Such 
diseases often involve interactions between genes, and the question is therefore well-suited to an 
automated genetic approach, such as Synthetic Genetic Array technology, which was pioneered in 
bakers yeast and more recently adopted to another well-characterized model system, namely S. pombe. 
In this study the authors examine conserved genetic interactions between a fission yeast genes and a 
human query gene (both wildtype and loss-of-function mutants) then combine their experimental data with 
bioinformatics to characterize the network of genetic interactions and protein-protein interactions. They 
then leverage this network to identify interacting partners that comprise novel therapeutic targets. 

The example presented in this paper is the human UBA1 gene which encodes a gene that, when 
mutated, results in human X-linked spinal muscular atrophy disease. They perform whole genome genetic 
interaction mapping for digenetic interactions with both wild type and mutant versions of the gene, build a 
network with the observed interactions, and then expand that primary network by adding the known 
partners that interact with members of the primary network. They then turned to the zebrafish vertebrate 
model system to validate their newly identified therapeutic targets. 

This study adds another tool to the armamentarium of systems biology approaches with which the 
complexity of human diseases can be dissected. Because the system is new in the sense that fission 
yeast has been largely unexplored (compared to its budding yeast relative from which it has diverged by 
several hundred million years) the observations on fission yeast, when combined with those from other 
model systems, provide an attractive complementary approach to understanding complex human biology. 

Given that this study builds on earlier approaches, the authors should cite the previously published 
studies on genetic interactions, chemical genetic interactions and protein-protein interactions using 
budding yeast, fly and other models. Similarly, the comprehensive gene expression profiling approaches 
used in cultured mammalian cells as well as more recent whole genome RNA interference and 
CRISPR-based disruption screens should be cited. 

From a methodological perspective, in the interaction maps presented in Figures 3 and 4 it would be 
useful to describe or cite the methods used to define the links between genes. In the Results, where the 
authors describe the results of the different screens which include wild type and mutant human protein 
interactions as well as primary and secondary interactions it would be instructive to provide a graphic or 
flow chart that depicts how all the key pieces of data from each independent screen in fission yeast (and 
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their analysis) were collapsed and presented together. It would also be instructive to provide background 
information or appropriate citations on the two compounds used in the zebrafish experiments, specifically 
their known or suspected mechanisms of action as well as the data to support these mechanisms. 

In the Discussion, the authors have room to put the YANA screening platform into context, comparing it to 
other orthologous genome-wide approaches. 

A key innovation in this study arises from the unique approach used in analyzing the data. Specifically, the 
conserved yeast genes that modify the phenotype induced by expressing a human gene of interest are 
assembled into human disease gene networks which are then augmented by additional protein-protein 
interaction data. Augmenting traditional network analysis in this manner may permit the identification of 
new therapeutic targets and target pathways. In summary, this comprehensive, genome-wide work 
represents an important addition to the systems biology toolkit to understand the complexities of human 
biology and the impact of this and subsequent data sets will surely increase as additional screens are 
added to the compendium. 

Minor points 

• Western blot and growth curve: "or eight generations" is a typo, also the type of media the cells 
were grown in "exponentially for 16 hours" should be listed. 

• In Figure 2 it is not clear which band the tubulin arrow is pointing to- the loading control bands 
should be indicated directly on the blot/gel. Also, it is apparent from this loading control that the 
amount of UPA1 expressed is extremely high, the authors should comment on the fact that the 
overexpressing lanes are actually UNDERIoaded relative to the non-expressing lanes. 

• Third paragraph of the Results: the software programme cytoscape should include a reference. 

• Third paragraph of the Results: histone is misspelled "histon". 

• Sixth paragraph of the Results: the authors should clearly distinguish between the published data 
in zebrafish and their new data collected in this study 

• Sixth paragraph of the Results: as mentioned be inhibitor's mechanism of action should be 
described in more detail 

• Second paragraph of the discussion: approximately 30,000 genes in the human genome should be 
cited and probably corrected to the conventional number of 21 ,000. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: I was involved in some of the preliminary discussions with the authors about this 
work in the early phases when these studies were being conceived. However, I can confirm that I was not 
involved in carrying out any of the research nor in the writing of the final article. 
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